Deeply Tensor Compressed Transformers for End-to-End Object Detection
نویسندگان
چکیده
DEtection TRansformer (DETR) is a recently proposed method that streamlines the detection pipeline and achieves competitive results against two-stage detectors such as Faster-RCNN. The DETR models get rid of complex anchor generation post-processing procedures thereby making more intuitive. However, numerous redundant parameters in transformers make computation storage intensive, which seriously hinder them to be deployed on resources-constrained devices. In this paper, obtain compact end-to-end framework, we propose deeply compress with low-rank tensor decomposition. basic idea tensor-based compression represent large-scale weight matrix one network layer chain low-order matrices. Furthermore, gated multi-head attention (GMHA) module mitigate accuracy drop tensor-compressed models. GMHA, each head has an independent gate determine passed value. information can suppressed by adopting normalized gates. Lastly, fully compressed models, low-bitwidth quantization technique introduced for further reducing model size. Based methods, achieve significant parameter size reduction while maintaining high performance. We conduct extensive experiments COCO dataset validate effectiveness our (tensorized) experimental show attain 3.7 times full 482 feed forward (FFN) only 0.6 points drop.
منابع مشابه
DenseBox: Unifying Landmark Localization with End to End Object Detection
How can a single fully convolutional neural network (FCN) perform on object detection? We introduce DenseBox, a unified end-to-end FCN framework that directly predicts bounding boxes and object class confidences through all locations and scales of an image. Our contribution is two-fold. First, we show that a single FCN, if designed and optimized carefully, can detect multiple different objects ...
متن کاملEnd-to-end esophagojejunostomy versus standard end-to-side esophagojejunostomy: which one is preferable?
Abstract Background: End-to-side esophagojejunostomy has almost always been associated with some degree of dysphagia. To overcome this complication we decided to perform an end-to-end anastomosis and compare it with end-to-side Roux-en-Y esophagojejunostomy. Methods: In this prospective study, between 1998 and 2005, 71 patients with a diagnosis of gastric adenocarcinoma underwent total gastrec...
متن کاملVoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
Accurate detection of objects in 3D point clouds is a central problem in many applications, such as autonomous navigation, housekeeping robots, and augmented/virtual reality. To interface a highly sparse LiDAR point cloud with a region proposal network (RPN), most existing efforts have focused on hand-crafted feature representations, for example, a bird’s eye view projection. In this work, we r...
متن کاملAffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
We propose AffordanceNet, a new deep learning approach to simultaneously detect multiple objects and their affordances from RGB images. Our AffordanceNet has two branches: an object detection branch to localize and classify the object, and an affordance detection branch to assign each pixel in the object to its most probable affordance label. The proposed framework employs three key components ...
متن کاملSaliency Guided End-to-End Learning for Weakly Supervised Object Detection
Weakly supervised object detection (WSOD), which is the problem of learning detectors using only image-level labels, has been attracting more and more interest. However, this problem is quite challenging due to the lack of location supervision. To address this issue, this paper integrates saliency into a deep architecture, in which the location information is explored both explicitly and implic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i4.20397